Serveur d'exploration sur la recherche en informatique en Lorraine

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Accuracy of a markerless acquisition technique for studying speech articulators. In Interspeech 2015

Identifieur interne : 000344 ( Main/Exploration ); précédent : 000343; suivant : 000345

Accuracy of a markerless acquisition technique for studying speech articulators. In Interspeech 2015

Auteurs : Andrea Bandini [Italie] ; Slim Ouni [France] ; Piero Cosi [Italie] ; Silvia Orlandi [Italie] ; Claudia Manfredi [Italie]

Source :

RBID : Hal:hal-01189000

Abstract

The main disadvantages of the existing methods for studying speech articulators (such as electromagnetic and optoelectronic systems) are the high cost and the discomfort to participants or patients. The aim of this work is to introduce a completely markerless low-cost 3D tracking technique in the context of speech articulation, and then compare it with a well-established marker-based one to evaluate the performances. A Kinect-like device was used in conjunction with an existing face tracking algorithm to track lips movements in 3D without markers. The method was tested on two subjects uttering 200 words and 100 sentences. For most of points of the lips the RMSE ranged between 1 and 3 mm. Although the image resolution used in this experiment was low, these results are very promising. Nevertheless, further studies should consider higher video resolutions in order to obtain better results.

Url:


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Accuracy of a markerless acquisition technique for studying speech articulators. In Interspeech 2015</title>
<author>
<name sortKey="Bandini, Andrea" sort="Bandini, Andrea" uniqKey="Bandini A" first="Andrea" last="Bandini">Andrea Bandini</name>
<affiliation wicri:level="1">
<hal:affiliation type="institution" xml:id="struct-30978" status="VALID">
<orgName>Università di Bologna [Bologna]</orgName>
<orgName type="acronym">UNIBO</orgName>
<desc>
<address>
<addrLine>Via Zamboni, 33 - 40126 Bologna</addrLine>
<country key="IT"></country>
</address>
<ref type="url">http://www.eng.unibo.it/PortaleEn/default.htm</ref>
</desc>
</hal:affiliation>
<country>Italie</country>
</affiliation>
</author>
<author>
<name sortKey="Ouni, Slim" sort="Ouni, Slim" uniqKey="Ouni S" first="Slim" last="Ouni">Slim Ouni</name>
<affiliation wicri:level="1">
<hal:affiliation type="researchteam" xml:id="struct-420403" status="VALID">
<idno type="RNSR">201421147E</idno>
<orgName>Speech Modeling for Facilitating Oral-Based Communication</orgName>
<orgName type="acronym">MULTISPEECH</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/equipes/multispeech</ref>
</desc>
<listRelation>
<relation active="#struct-129671" type="direct"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-423086" type="direct"></relation>
<relation active="#struct-206040" type="indirect"></relation>
<relation active="#struct-413289" type="indirect"></relation>
<relation name="UMR7503" active="#struct-441569" type="indirect"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-129671" type="direct">
<org type="laboratory" xml:id="struct-129671" status="VALID">
<idno type="RNSR">198618246Y</idno>
<orgName>INRIA Nancy - Grand Est</orgName>
<desc>
<address>
<addrLine>615 rue du Jardin Botanique 54600 Villers-lès-Nancy</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/nancy</ref>
</desc>
<listRelation>
<relation active="#struct-300009" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-300009" type="indirect">
<org type="institution" xml:id="struct-300009" status="VALID">
<orgName>Institut National de Recherche en Informatique et en Automatique</orgName>
<orgName type="acronym">Inria</orgName>
<desc>
<address>
<addrLine>Domaine de VoluceauRocquencourt - BP 10578153 Le Chesnay Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/en/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-423086" type="direct">
<org type="department" xml:id="struct-423086" status="VALID">
<orgName>Department of Natural Language Processing & Knowledge Discovery</orgName>
<orgName type="acronym">LORIA - NLPKD</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.loria.fr/la-recherche-en/departements/Knowledge-and-Language-Management</ref>
</desc>
<listRelation>
<relation active="#struct-206040" type="direct"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-413289" type="indirect"></relation>
<relation name="UMR7503" active="#struct-441569" type="indirect"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-206040" type="indirect">
<org type="laboratory" xml:id="struct-206040" status="VALID">
<idno type="IdRef">067077927</idno>
<idno type="RNSR">198912571S</idno>
<idno type="IdUnivLorraine">[UL]RSI--</idno>
<orgName>Laboratoire Lorrain de Recherche en Informatique et ses Applications</orgName>
<orgName type="acronym">LORIA</orgName>
<date type="start">2012-01-01</date>
<desc>
<address>
<addrLine>Campus Scientifique BP 239 54506 Vandoeuvre-lès-Nancy Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.loria.fr</ref>
</desc>
<listRelation>
<relation active="#struct-300009" type="direct"></relation>
<relation active="#struct-413289" type="direct"></relation>
<relation name="UMR7503" active="#struct-441569" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-413289" type="indirect">
<org type="institution" xml:id="struct-413289" status="VALID">
<idno type="IdRef">157040569</idno>
<idno type="IdUnivLorraine">[UL]100--</idno>
<orgName>Université de Lorraine</orgName>
<orgName type="acronym">UL</orgName>
<date type="start">2012-01-01</date>
<desc>
<address>
<addrLine>34 cours Léopold - CS 25233 - 54052 Nancy cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lorraine.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle name="UMR7503" active="#struct-441569" type="indirect">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="ISNI">0000000122597504</idno>
<idno type="IdRef">02636817X</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">Nancy</settlement>
<settlement type="city">Metz</settlement>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
</placeName>
<orgName type="university">Université de Lorraine</orgName>
</affiliation>
</author>
<author>
<name sortKey="Cosi, Piero" sort="Cosi, Piero" uniqKey="Cosi P" first="Piero" last="Cosi">Piero Cosi</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-243458" status="INCOMING">
<orgName>Institute of Cognitive Sciences and Technologies</orgName>
<desc>
<address>
<addrLine>Via alla Cascata 56C, 38123 Povo (TN)</addrLine>
<country key="IT"></country>
</address>
<ref type="url">http://www.istc.cnr.it/</ref>
</desc>
<listRelation>
<relation active="#struct-302223" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-302223" type="direct">
<org type="institution" xml:id="struct-302223" status="VALID">
<orgName>Consiglio Nazionale delle Ricerche [Roma]</orgName>
<orgName type="acronym">CNR</orgName>
<desc>
<address>
<addrLine>Piazzale Aldo Moro,7 - 00185, Roma</addrLine>
<country key="IT"></country>
</address>
<ref type="url">http://www.cnr.it/sitocnr/home.html</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Italie</country>
</affiliation>
</author>
<author>
<name sortKey="Orlandi, Silvia" sort="Orlandi, Silvia" uniqKey="Orlandi S" first="Silvia" last="Orlandi">Silvia Orlandi</name>
<affiliation wicri:level="1">
<hal:affiliation type="institution" xml:id="struct-148408" status="VALID">
<orgName>Università degli Studi di Firenze [Firenze]</orgName>
<desc>
<address>
<addrLine>P.zza S.Marco, 4 - 50121 Firenze</addrLine>
<country key="IT"></country>
</address>
<ref type="url">http://www.unifi.it/</ref>
</desc>
</hal:affiliation>
<country>Italie</country>
</affiliation>
</author>
<author>
<name sortKey="Manfredi, Claudia" sort="Manfredi, Claudia" uniqKey="Manfredi C" first="Claudia" last="Manfredi">Claudia Manfredi</name>
<affiliation wicri:level="1">
<hal:affiliation type="institution" xml:id="struct-148408" status="VALID">
<orgName>Università degli Studi di Firenze [Firenze]</orgName>
<desc>
<address>
<addrLine>P.zza S.Marco, 4 - 50121 Firenze</addrLine>
<country key="IT"></country>
</address>
<ref type="url">http://www.unifi.it/</ref>
</desc>
</hal:affiliation>
<country>Italie</country>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">HAL</idno>
<idno type="RBID">Hal:hal-01189000</idno>
<idno type="halId">hal-01189000</idno>
<idno type="halUri">https://hal.inria.fr/hal-01189000</idno>
<idno type="url">https://hal.inria.fr/hal-01189000</idno>
<date when="2015-09-06">2015-09-06</date>
<idno type="wicri:Area/Hal/Corpus">000A65</idno>
<idno type="wicri:Area/Hal/Curation">000A65</idno>
<idno type="wicri:Area/Hal/Checkpoint">000320</idno>
<idno type="wicri:explorRef" wicri:stream="Hal" wicri:step="Checkpoint">000320</idno>
<idno type="wicri:Area/Main/Merge">000344</idno>
<idno type="wicri:Area/Main/Curation">000344</idno>
<idno type="wicri:Area/Main/Exploration">000344</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Accuracy of a markerless acquisition technique for studying speech articulators. In Interspeech 2015</title>
<author>
<name sortKey="Bandini, Andrea" sort="Bandini, Andrea" uniqKey="Bandini A" first="Andrea" last="Bandini">Andrea Bandini</name>
<affiliation wicri:level="1">
<hal:affiliation type="institution" xml:id="struct-30978" status="VALID">
<orgName>Università di Bologna [Bologna]</orgName>
<orgName type="acronym">UNIBO</orgName>
<desc>
<address>
<addrLine>Via Zamboni, 33 - 40126 Bologna</addrLine>
<country key="IT"></country>
</address>
<ref type="url">http://www.eng.unibo.it/PortaleEn/default.htm</ref>
</desc>
</hal:affiliation>
<country>Italie</country>
</affiliation>
</author>
<author>
<name sortKey="Ouni, Slim" sort="Ouni, Slim" uniqKey="Ouni S" first="Slim" last="Ouni">Slim Ouni</name>
<affiliation wicri:level="1">
<hal:affiliation type="researchteam" xml:id="struct-420403" status="VALID">
<idno type="RNSR">201421147E</idno>
<orgName>Speech Modeling for Facilitating Oral-Based Communication</orgName>
<orgName type="acronym">MULTISPEECH</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/equipes/multispeech</ref>
</desc>
<listRelation>
<relation active="#struct-129671" type="direct"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-423086" type="direct"></relation>
<relation active="#struct-206040" type="indirect"></relation>
<relation active="#struct-413289" type="indirect"></relation>
<relation name="UMR7503" active="#struct-441569" type="indirect"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-129671" type="direct">
<org type="laboratory" xml:id="struct-129671" status="VALID">
<idno type="RNSR">198618246Y</idno>
<orgName>INRIA Nancy - Grand Est</orgName>
<desc>
<address>
<addrLine>615 rue du Jardin Botanique 54600 Villers-lès-Nancy</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/nancy</ref>
</desc>
<listRelation>
<relation active="#struct-300009" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-300009" type="indirect">
<org type="institution" xml:id="struct-300009" status="VALID">
<orgName>Institut National de Recherche en Informatique et en Automatique</orgName>
<orgName type="acronym">Inria</orgName>
<desc>
<address>
<addrLine>Domaine de VoluceauRocquencourt - BP 10578153 Le Chesnay Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.inria.fr/en/</ref>
</desc>
</org>
</tutelle>
<tutelle active="#struct-423086" type="direct">
<org type="department" xml:id="struct-423086" status="VALID">
<orgName>Department of Natural Language Processing & Knowledge Discovery</orgName>
<orgName type="acronym">LORIA - NLPKD</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.loria.fr/la-recherche-en/departements/Knowledge-and-Language-Management</ref>
</desc>
<listRelation>
<relation active="#struct-206040" type="direct"></relation>
<relation active="#struct-300009" type="indirect"></relation>
<relation active="#struct-413289" type="indirect"></relation>
<relation name="UMR7503" active="#struct-441569" type="indirect"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-206040" type="indirect">
<org type="laboratory" xml:id="struct-206040" status="VALID">
<idno type="IdRef">067077927</idno>
<idno type="RNSR">198912571S</idno>
<idno type="IdUnivLorraine">[UL]RSI--</idno>
<orgName>Laboratoire Lorrain de Recherche en Informatique et ses Applications</orgName>
<orgName type="acronym">LORIA</orgName>
<date type="start">2012-01-01</date>
<desc>
<address>
<addrLine>Campus Scientifique BP 239 54506 Vandoeuvre-lès-Nancy Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.loria.fr</ref>
</desc>
<listRelation>
<relation active="#struct-300009" type="direct"></relation>
<relation active="#struct-413289" type="direct"></relation>
<relation name="UMR7503" active="#struct-441569" type="direct"></relation>
</listRelation>
</org>
</tutelle>
<tutelle active="#struct-413289" type="indirect">
<org type="institution" xml:id="struct-413289" status="VALID">
<idno type="IdRef">157040569</idno>
<idno type="IdUnivLorraine">[UL]100--</idno>
<orgName>Université de Lorraine</orgName>
<orgName type="acronym">UL</orgName>
<date type="start">2012-01-01</date>
<desc>
<address>
<addrLine>34 cours Léopold - CS 25233 - 54052 Nancy cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lorraine.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle name="UMR7503" active="#struct-441569" type="indirect">
<org type="institution" xml:id="struct-441569" status="VALID">
<idno type="ISNI">0000000122597504</idno>
<idno type="IdRef">02636817X</idno>
<orgName>Centre National de la Recherche Scientifique</orgName>
<orgName type="acronym">CNRS</orgName>
<date type="start">1939-10-19</date>
<desc>
<address>
<country key="FR"></country>
</address>
<ref type="url">http://www.cnrs.fr/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">Nancy</settlement>
<settlement type="city">Metz</settlement>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
</placeName>
<orgName type="university">Université de Lorraine</orgName>
</affiliation>
</author>
<author>
<name sortKey="Cosi, Piero" sort="Cosi, Piero" uniqKey="Cosi P" first="Piero" last="Cosi">Piero Cosi</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-243458" status="INCOMING">
<orgName>Institute of Cognitive Sciences and Technologies</orgName>
<desc>
<address>
<addrLine>Via alla Cascata 56C, 38123 Povo (TN)</addrLine>
<country key="IT"></country>
</address>
<ref type="url">http://www.istc.cnr.it/</ref>
</desc>
<listRelation>
<relation active="#struct-302223" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-302223" type="direct">
<org type="institution" xml:id="struct-302223" status="VALID">
<orgName>Consiglio Nazionale delle Ricerche [Roma]</orgName>
<orgName type="acronym">CNR</orgName>
<desc>
<address>
<addrLine>Piazzale Aldo Moro,7 - 00185, Roma</addrLine>
<country key="IT"></country>
</address>
<ref type="url">http://www.cnr.it/sitocnr/home.html</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Italie</country>
</affiliation>
</author>
<author>
<name sortKey="Orlandi, Silvia" sort="Orlandi, Silvia" uniqKey="Orlandi S" first="Silvia" last="Orlandi">Silvia Orlandi</name>
<affiliation wicri:level="1">
<hal:affiliation type="institution" xml:id="struct-148408" status="VALID">
<orgName>Università degli Studi di Firenze [Firenze]</orgName>
<desc>
<address>
<addrLine>P.zza S.Marco, 4 - 50121 Firenze</addrLine>
<country key="IT"></country>
</address>
<ref type="url">http://www.unifi.it/</ref>
</desc>
</hal:affiliation>
<country>Italie</country>
</affiliation>
</author>
<author>
<name sortKey="Manfredi, Claudia" sort="Manfredi, Claudia" uniqKey="Manfredi C" first="Claudia" last="Manfredi">Claudia Manfredi</name>
<affiliation wicri:level="1">
<hal:affiliation type="institution" xml:id="struct-148408" status="VALID">
<orgName>Università degli Studi di Firenze [Firenze]</orgName>
<desc>
<address>
<addrLine>P.zza S.Marco, 4 - 50121 Firenze</addrLine>
<country key="IT"></country>
</address>
<ref type="url">http://www.unifi.it/</ref>
</desc>
</hal:affiliation>
<country>Italie</country>
</affiliation>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass></textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">The main disadvantages of the existing methods for studying speech articulators (such as electromagnetic and optoelectronic systems) are the high cost and the discomfort to participants or patients. The aim of this work is to introduce a completely markerless low-cost 3D tracking technique in the context of speech articulation, and then compare it with a well-established marker-based one to evaluate the performances. A Kinect-like device was used in conjunction with an existing face tracking algorithm to track lips movements in 3D without markers. The method was tested on two subjects uttering 200 words and 100 sentences. For most of points of the lips the RMSE ranged between 1 and 3 mm. Although the image resolution used in this experiment was low, these results are very promising. Nevertheless, further studies should consider higher video resolutions in order to obtain better results.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>France</li>
<li>Italie</li>
</country>
<region>
<li>Grand Est</li>
<li>Lorraine (région)</li>
</region>
<settlement>
<li>Metz</li>
<li>Nancy</li>
</settlement>
<orgName>
<li>Université de Lorraine</li>
</orgName>
</list>
<tree>
<country name="Italie">
<noRegion>
<name sortKey="Bandini, Andrea" sort="Bandini, Andrea" uniqKey="Bandini A" first="Andrea" last="Bandini">Andrea Bandini</name>
</noRegion>
<name sortKey="Cosi, Piero" sort="Cosi, Piero" uniqKey="Cosi P" first="Piero" last="Cosi">Piero Cosi</name>
<name sortKey="Manfredi, Claudia" sort="Manfredi, Claudia" uniqKey="Manfredi C" first="Claudia" last="Manfredi">Claudia Manfredi</name>
<name sortKey="Orlandi, Silvia" sort="Orlandi, Silvia" uniqKey="Orlandi S" first="Silvia" last="Orlandi">Silvia Orlandi</name>
</country>
<country name="France">
<region name="Grand Est">
<name sortKey="Ouni, Slim" sort="Ouni, Slim" uniqKey="Ouni S" first="Slim" last="Ouni">Slim Ouni</name>
</region>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000344 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000344 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Hal:hal-01189000
   |texte=   Accuracy of a markerless acquisition technique for studying speech articulators. In Interspeech 2015
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022